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© A method, and apparatus for accomplishing 
same, of providing character prototypes for use by a 
handwriting character recognition system 10. A first 
method includes a first step of providing a first 
handwritten character; a second step of copying the 
first character to provide a second character; and a 
third step of modifying the second character to (a) 
emphasize dissimilarities between the second char- 
acter and the first character and to (b) de-emphasize 
similarities between the second character and the 
first character. A second method includes a first step 
of providing a first handwritten character and a sec- 
ond handwritten character, the first and the second 
handwritten characters each being comprised of a 
plurality of points. A second step of the method 
matches corresponding points between the first 
character and the second character. A third step of 
the method processes pairs of matching points to 
separate one. from another each point of the match- 



ing pair by an amount proportional to an initial sepa- 
ration between each matched corresponding point, 
so as to emphasize dissimilarities between the first 
character and the second character. 
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This invention relates generally to handwriting 
recognition systems and, in particular, to methods 
and apparatus for optimizing a set of character 
prototypes used for matching against an unknown 
input character. 

Many on-line, handwriting recognition systems 
employ curve matching methods to match an un- 
known character against prototype, or template, 
characters. Examples of such systems are de- 
scribed in the following articles: W. Doster and R. 
Oed, "Word processing with on-line script recogni- 
tion," IEEE Micro., vol. 4, pp. 36-43, October 1984; 
K. Ikeda, T. Yamamura, Y. Mitamura, S. Fujiwara, 
Y. Tominaga, and T. Kiyono, "On-line recognition 
of handwritten characters utilizing positional and 
stroke vector sequences," Proc. 4th Int. Jt. Conf. 
Pattern Recognition, pp. 813-815, November 1978; 
CO Tappert, "Adaptive on-line handwriting rec- 
ognition," Proc. 7th Int. Conf. Pattern Recognition, 
pp. 1004-1007, 1984; CO. Tappert, "Speed, accu- 
racy, flexibility trade-offs in on-line character rec- 
ognition," IBM Research Report RC13228, October 
1987; and T. Wakahara and M. Umeda, "Stroke- 
number and stroke-order free on-line character rec- 
ognition by selective stroke linkage method," Proc. 
4th IOTP, pp. 157-162, 1983. In general, the rec- 
ognition accuracy of such prototype-based hand- 
writing recognition systems is a function of the 
quality of the prototypes. 

One problem encountered in on-line handwrit- 
ing recognition is the discrimination of similarly- 
shaped handwritten characters, such as A-H, h-k, 
g-y, etc. In that known curve matching methods 
weigh all portions of the curves of characters 
equally, non-essential portions may cause errors. In 
particular, with curve matching methods an unim- 
portant feature or portion of the character, such as 
the shape of the descender or tail of "g" and "y". 
may cause recognition errors. 

The following U.S. and Japanese Patents all 
teach various aspects of handwriting recognition 
systems. 

US-A-4 561 105, issued December 24, 1985, 
entitled "Complex Pattern Recognition Method and 
System" to Crane et al. describes an on-line char- 
acter recognition system that uses template or 
model strokes (col. 3, lines 1-52). 

US-A-4 573 196, issued February 25, 1986, 
entitled "Confusion Grouping of Strokes in Pattern 
Recognition Method and System" to Crane et al. 
also describes an on-line character recognition sys- 
tem that uses template or model strokes and which 
further provides a distance metric to categorize a 
stroke as a member of a "confusion group". 

US-A-4 718 102, issued January 5, 1988, en- 
titled "Process and Apparatus Involving Pattern 
Recognition" to Crane et al. describes a. hand- 
written character recognition system having disam- 



biguation routines to differentiate members of con- 
fusion sets of characters (col. 4). 

US-A-4 754 489, issued June 28, 1988, entitled 
"Means for Resolving Ambiguities in Text Based 

5 Upon Character Context" to Bokser describes a 
context algorithm that assigns a probability value to 
each member of a set of character strings in order 
to determine which character string in a set has a 
highest probability. 

io Commonly assigned US-A-4 731 857, issued 

March 15, 1988, entitled "Recognition System for 
Run-on Handwritten Characters" to C.C. Tappert 
describes a segmentation method for run-on hand- 
written characters that considers all stroke ends as 

75 possible character segmentation points. 

US-A-4 491 965, issued January 1, 1985, en- 
titled "Character Recognition Apparatus" to 
Yoshimura describes character recognition appara- 
tus that provides normalized character patterns to a 

20 recognizing section which compares the normal- 
ized pattern with entries in a dictionary. 

US-A-4 654 873, issued March 31, 1987, en- 
titled "System and Method for Segmentation and 
Recognition of Patterns" to Fujisawa et al. de- 

25 scribes a character recognition system for use in 
an optical character reader in which predetermined 
unit patterns are extracted from a two-dimensional 
image pattern. If an ambiguity exists in segmenta- 
tion of, the unit patterns, a number of hypothetical 

30 unit patterns are provided for the recognition unit 
for selecting one of the hypothetical unit patterns. 

Japanese Patent No. 62-24382 entitled 
"Method for Recognizing Handwritten Character" 
describes, in the abstract, a method for recognizing 

35 handwritten characters in which input strokes are 
compared with reference strokes stored in a tem- 
plate memory. 

Japanese Patent No. 61-68678 entitled "Optical 
Input Device of Drawing" describes an optical 

40 drawing input device where a symbol on a tem- 
plate is traced and compared to a registered sym- 
bol. 

What is not taught by this prior art, and what is 
thus an object of the invention to provide, is meth- 

45 od and apparatus for improving or optimizing a set 
of character prototypes, especially prototypes of 
character pairs that are known to be readily con- 
fusable one with the other. 

Preferably the invention is to provide a method, 

so and a means for accomplishing the method, that 
processes a pair of readily conf usable character 
prototypes so as to emphasize the differences be- 
tween the characters while also deemphasizing the 
similarities between the characters. 

55 This and other objects are advantageously 

solved basically by applying the features laid down 
in the independent claims. 

The invention provides for a method of creating 
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character prototypes for improving a recognition 
accuracy of similarly-shaped characters. The meth- 
od of the invention operates to emphasize those 
portions of similarly shaped characters that are 
known to be critical for discriminating between the 
similarly shaped characters and to de-emphasize 
those character portions that are found not to be 
critical for discrimination "purposes. 

One method includes a step of entering a 
character to form a character prototype, the writer 
writing those portions of the character that are 
critical for discrimination in a larger fashion than is 
normal while writing the unimportant features small- 
er or, for characters having a same feature in 
common, writing the unimportant feature identically 
for each of a pair of characters to be discriminated. 

In the creation of a starter set of character 
prototypes graphic techniques are employed to 
make the unimportant portions of similar characters 
identical. 

Another method employs a graphics tool to 
copy a trace of a first character. The copied trace 
is then modified, at a portion critical for differentiat- 
ing the first character from a confusable second 
character, to convert the first character to the sec- 
ond character. This technique ensures that similarly 
formed portions of the two characters are iden- 
tically formed and that a portion or portions, impor- 
tant for discrimination, are differently formed and 
are emphasized. 

A further method involves an automated cari- 
cature process. A first step chooses similar-char- 
acter prototypes to exaggerate. A second step 
matches similar character prototypes on a point-to- 
point basis. A third step computes modified proto- 
types composed of point pairs that are computed 
by displacing corresponding pairs of points away 
from one another by a distance that is proportional 
to an original distance between the point pairs. This 
method automatically makes the shapes of the two 
prototype characters less similar by emphasizing 
the dissimilar regions. 

The above set forth and other features of the 
invention are made more apparent in the ensuing 
detailed description of the invention when read in 
conjunction with the attached drawing, wherein: 
Fig. 1 is a simplified block diagram of a 
handwriting recognition system con- 
structed and operated in accordance 
with the invention; 
Fig. 2 illustrates, by example, two prototype 
characters associated with a readily 
confusable character pair and a plural- 
ity of differently formed input char- 
acters that may be misrecognized; 
Fig. 3 illustrates, by example, the prototype 
characters of Fig. 2 being formed so 
as to emphasize the differences be- 
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tween the characters and to deem- 

phasize the similarities; 
Fig. 4 illustrates, by example, two different 

character prototype pairs, specifically 
5 uppercase w A n and "H" and lower 

case "h" and "k"; 
Fig. 5 illustrates the two different character 

prototype pairs of Fig. 4 that are 

formed in such as manner as to em- 
70 phasize the differences between the 

characters and to deemphasize the 

similarities; 

Fig. 6 is a flowchart depicting a first method 
of the invention and also the effect of 
75 the operation of the method on the 

formation of an exemplary pair of 
character prototypes; and 
Fig. 7 is a flowchart depicting a second 
method of the invention and also the 
20 effect of the operation of the method 

on the formation of an exemplary pair 
of character prototypes. 
Referring to Fig. 1 there is shown an on-line 
handwriting character recognition system 10 con- 
25 structed and operated in accordance with the in- 
vention. System 10 includes a handwriting trans- 
ducer comprised of an electronic tablet 12 having a 
writing implement, such as a stylus 14. In response 
to handwritten characters the transducer outputs a 
30 serial stream of (x.y) coordinate pairs correspond- 
ing to a position of the stylus 14 relative to a tablet . 
12 coordinate system. The coordinate pairs are 
provided to a stroke segmenter 16 that identifies a 
beginning and an end of discrete handwritten 
35 strokes. Strokes are provided to a character match- 
er 18. Character matcher 18 is coupled to a store 
20 of predetermined character prototypes for com- 
paring points of input strokes to points of stored 
character prototypes so as to identify, to within 
40 some probability, that an input stroke or assem- 
blage of strokes represents a specific character. 
Recognized characters are output from the matcher 
18 and may be displayed to a user or input to an 
application, such as a word processing program. In 
45 some systems the functionality of the segmenter 
16 and the character matcher 18 may be combined 
within one handwritten stroke processing unit. 

Character prototypes may, be provided to the 
store 20 during a training session held with a user 
so of the system 10. The character prototypes may be 
entered by a specific user of the system and 
thereafter used only when the specific user enters 
handwritten characters. Also, a global or universal 
set of character prototypes may be generated for 
55 use with more than one user. 

In this regard a starter prototype set may be 
designed as an initial prototype set to be used by 
any writer on first using the system 10. If desired, a 
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writer may then add character prototypes in his/her 
own writing style to obtain a prototype set tailored 
to the writer, also referred to as an individualized 
prototype set. In that a starter prototype set is 
typically created by an expert in handwriting rec- 
ognition systems, there is no burden placed on a 
user of the system 10 to follow any writing sugges- 
tions or constraints. 

In any case, during a training session char- 
acters from character matcher 18 are routed 
through a physical or logical switch 22 to a proto- 
type processor 24. Prototype processor 24 is also 
coupled to a user input and output device, such as 
a keyboard 26 and a display 28. for controlling the 
operation thereof as will be described in detail 
below. The output of prototype processor 24 is a 
series of prototype characters that are supplied to 
store 20 for subsequent use during a handwriting 
recognition session conducted with a user of the 
system 10. 

In accordance with the invention there are pro- 
vided methods of operating the prototype proces- 
sor 24 so as to create character prototypes for 
improving the recognition accuracy of similarly- 
shaped characters. In general, the methods of the 
invention operate to emphasize those portions of 
similarly shaped characters that are found to be 
critical for discriminating between the similarly 
shaped characters and to deemphasize those por- 
tions that are found not to be critical for discrimina- 
tion purposes. 

A first method includes a first step of drawing 
one character of a pair of confusable characters, 
such as the character "g" of the confusable pair 
n g,y". The writer intentionally draws those portions 
of the character that are critical for discrimination in 
a larger fashion than is normal, while drawing the 
unimportant features smaller or, for characters hav- 
ing a same feature in common, drawing the unim- 
portant feature identically for each of a pair of 
characters to be discriminated. 

For example^ and referring to Figs. 2 and 3, the 
descenders, or tails, of the "g" and "y" are not 
generally considered to be important for the "g-y" 
discrimination, and the writer is advised to draw 
them identically so that an unknown character is 
assigned the same score, by the character matcher 
18, when matching against that portion of the pro- 
totype characters. However, the upper portion of 
each of these characters, specifically the closure of 
the "g" and the open portion of the "y", are 
considered important for recognition, and the writer 
is advised to emphasize these elements of the 
characters, as in Fig. 3. 

Prototypes that are created in accordance with 
this method exhibit a greater percentage of char- 
acter curves that are devoted to features important 
for discrimination. For example, recognition of up- 



percase "O" versus lowercase "o" has been found 
- to improve when the difference in size (diameter) is 
increased for the upper case "O". Also, and as is 
seen in Figs. 4 and 5, "A" versus "H" discrimina- 
5 tion was found to improve when the vertical strokes 
of the "H" were separated more at the top than at 
the bottom, so as to exaggerate that feature that 
best distinguishes "H" from "A". Also, discrimina- 
tion between cursive lowercase "h" and "k" was 
w found to improve markedly when the distinguishing 
features in these prototypes were drawn in exag- 
gerated fashion. For example, the lower curved 
portion of the "h" is widened while the cusp of the 
"k" is intentionally drawn to the left so that it 
75 extends past the vertical stroke. 

This method has been successfully employed 
with both starter and individualized prototype sets. 
In the creation of a starter set, graphic techniques 
are preferably employed to make the unimportant 
20 portions of similar characters identical. 

Referring to Fig. 6 there is illustrated in 
flowchart form a further method of the invention. 
This method employs a graphics tool, such as a 
graphics software package known in the art as 
25 Freelance, to copy the trace of a character and 
then to graphically modify the important portion so 
as to -convert it to the other character of the 
confusable-character pair. 

In a first step (A) a user draws one of the 
30 characters of the confusable character pair, by 
example a "g" of the pair "g,y n . 

In a second step (B) the input character is 
filtered, using the graphics software package, by 
moving x-y coordinate points of the drawn char- 
35 acter to smooth lines, remove aberrant points, and 
make the character more regular in shape. 

In a third step (C) the filtered first character is 
copied to form a second character. In a fourth step 
(D) unimportant portions of the first and second 
40 characters are made identical, such/as the de- 
scenders of the g and y. Also, by manually moving 
points of one or both of the characters the differ- 
ences between the important features are exag- 
gerated, such as the exaggeration of the closure of 
45 the upper portion of the "g" versus the upper, open 
portion of the "y n . 

At step E the two prototype characters are 
stored in the store 20 for subsequent use. 

A further method, illustrated in Fig. 7, employs 
so an automated caricature process to exaggerate the 
critical differences between handwritten character 
prototypes. Step A selects a pair of similar-char- 
acter prototypes, such as "h,k". At step B the 
similar-character prototypes are matched on a 
55 point-to-point basis. Preferably, this matching is 
accomplished by the character matcher 18 through 
the use of an elastic matching algorithm of the type 
discussed by C.C. Tappert in the aforementioned 
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journal article entitled "Adaptive On-Line Handwrit- 
ing Recognition". The elastic matcher operates to 
treat one of the characters of the pair as an un- 
known and the other as a prototype. The char- 
acters are elastically deformed so as to achieve a 
best-fit, characterized as a lowest score, between 
the characters. This process also results in an 
alignment of the two characters such that cor- 
responding pairs of points, one from each char- 
acter, are associated with one another for subse- 
quent processing. 

At step C the two similar-character prototypes 
are processed to move points, in a manner de- 
scribed below, so as to emphasize differences and 
de-emphasize similarities between the two char- 
acters. At step D the generated similar-character 
prototypes are stored in the store 20 for subse- 
quent use. 

Steps B and C may be expressed as a com- 
putation executed by the prototype processor 24 
as described below. 

To match similar-character prototypes .1 and 2 
on a point-to-point basis perform the following 
steps (a)-(d), for all pairs of corresponding points 
(P1.P2): 

(a) let p1 = the point in prototype 1; 

(b) let p2 = the point in prototype 2; 

(c) denote a vector drawn from p1 to p2 as (p2- 

pi>; 

(d) compute modified prototypes V and 2\ com- 
prised of point pairs pi' and p2\ respectively, 
such that 

pr = p1 + k(p1-p2);and 
p2' = p2 + k(p2- P 1), ■ 

using vector addition, with scalar k > 0 chosen 
appropriately. 

This technique operates to move correspond- 
ing pairs of points (p1 ,p2) away from one other by 
a distance proportional to their original distance. 
This serves to make the shapes of the two proto- 
type characters less similar by emphasizing their 
dissimilar regions. The scalar k is a caricaturization 
factor that controls the degree of distortion to cre- 
ate the caricature versions of the writing. 

Claims 

1. A method of providing character prototypes for 
use by a handwriting character recognition 
system, comprising the steps of: 

providing a first handwritten character; 

copying the first character to provide a second 
character; and 



modifying the second character to (a) empha- 
size dissimilarities between the second char- 
acter and the first character and to (b) de- 
emphasize similarities between the second 
5 character and the first character. 

2. A method as set forth in claim 1 and further 
including a step of storing the first character 
and the modified second character within a 

io store of character prototypes (20). 

3. A method as set forth in claim 1 or 2 wherein 
the first character is represented as a set of 
points referenced to a two-dimensional coordi- 

75 nate system and wherein the step of copying 

includes an initial step of processing the first 
character to remove aberrant points from the 
first character. 

20 4. A method of providing character prototypes for 
use by a handwriting character recognition 
system, comprising the steps of : 

providing a first handwritten character and a 
25 second handwritten character, the first and the 

second handwritten characters each being 
comprised of a plurality of points; 

matching corresponding points between the 
30 first character and the second character; and 

processing pairs of matching points to sepa- 
rate one from another each point of the match- 
ing pair by an amount proportional to an initial 

35 separation between each matched correspond- 

ing point of the pair, so as to emphasize dis- 
similarities between the first character and the 
second character, preferably storing the pro- 
cessed first character and the processed sec- 

40 ond character within a store of character proto- 

types. 

5. A method as set forth in claim 4 wherein the 
step of matching includes a step of elastically 

45 matching the first character and the second 

character to align the first character and the 
second character. 

6. A method as set forth in claim 4 or 5 wherein 
so step of processing includes, for all pairs of 

corresponding points (p1 ,p2), the steps of: 

(a) let p1 = the point in the first character; 

(b) let p2 = the point in the second char- 
acter; 

55 (c) denote a vector drawn from p1 to p2 as 

(p2-p1); and 

(d) compute, using vector addition, a modi- 
fied first character and a modified second 
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character, comprised of point pairs pV and 
p2\ respectively, such that 



zero. 



p1' = p1 + k(p1-p2); and 
p2' = p2 + k(p2-p1), 



5 



11. Apparatus as set forth in claim 10 wherein k is 
a caricaturization factor having a value select- 
ed to control an amount of distortion of the first 
character and the second character. 



where k is chosen to be greater than zero. 

7. A method as set forth in claim 6 wherein k is a 
caricaturization factor having a value selected 10 
to control an amount of distortion of the first 
character and the second character. 

8. Apparatus for providing character prototypes 

for use by a handwriting character recognition 75 
system, comprising: 

means (12, 14) for providing a first handwritten 
character and a second handwritten character, 
the first and the second handwritten characters 20 
each being comprised of a plurality of points; 

means (18) for matching corresponding points 
between the first character and the second 
character; and 25 

means (24) for processing pairs of matching 
points to separate one from another each point 
of the matching pair by an amount proportional 
to an initial separation between each matched 30 
corresponding point of the pair so as to em- 
phasize dissimilarities between the first char- 
acter and the second character, said apparatus 
preferably including means for storing the pro- 
cessed first character and the processed sec- 35 
ond character. 

9. Apparatus as set forth in claim 8 wherein the 
matching means includes means for elastically 
matching the first character and the second 40 
character to align the first character and the 
second character. 

10. Apparatus as set forth in claim 8 or 9 wherein 
processing means includes means, responsive 45 
to all pairs of corresponding points (p1,p2), 
where p1 = the point in the first character, 
where p2 = the point in the second character, 

and where (p2-pl) denotes a vector drawn 
from pi to p2, for computing a modified first 50 
character and a modified second character, 
comprised of point pairs pV and p2\ respec- 
tively, such that 

pr = p1 + k(p1-p2); and 55 
p2' = p2 + k(p2- P 1), 

where k is a scalar chosen to be greater than 
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© A method, and apparatus for accomplishing 
same, of providing character prototypes for use by a 
handwriting character recognition system 10. A first 
method includes a first step of providing a first 
handwritten character; a second step of copying the 
first character to provide a second character; and a 
00 third step of modifying the second character to (a) 
^ emphasize dissimilarities between the second char- 
O acter and the first character and to (b) de-emphasize 
lf> similarities between the second character and the 
^ first character. A second method includes a first step 
CO °f providing a first handwritten character and a sec- 
O ond handwritten, character, the first and the second 
^ handwritten characters each being comprised of a 
O plurality of points. A second step of the method 
n matches corresponding points between the first 
in character and the second character. A third step of 
the method processes pairs of matching points to 
separate one from another each point of the match- 



ing pair by an amount proportional to an initial sepa- 
ration between each matched corresponding point, 
so as to emphasize dissimilarities between the first 
character and the second character. 
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